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Abstract. We consider the proof complexity of the minimal complete fragment, KS, of 
standard deep inference systems for propositional logic. To examine the size of proofs we 
employ atomic flows, diagrams that trace structural changes through a proof but ignore 
logical information. As results we obtain a polynomial simulation of versions of Resolution, 
along with some extensions. We also show that these systems, as well as bounded-depth 
Frege systems, cannot polynomially simulate KS, by giving polynomial-size proofs of cer¬ 
tain variants of the propositional pigeonhole principle in KS. 


1. Introduction 

Deep inference is a relatively recent proof methodology whose systems differ from other 
formalisms by allowing derivations themselves to be composed by logical connectives. One 
of its main features is locality , i.e. inference steps can be checked in constant time, a property 
that is impossible to achieve in Gentzen systems [Brii03] . In recent years there has been an 
increasing interest in the proof complexity of deep inference |BG09] [.Ter09j St r 12 (BGGPlT)] 
P^Hj 1.) as!4j . in particular the weaker systems initially introduced by Briinnler and Tiu 
|BT01| . Perhaps the most notable result is that a certain system, denoted KS U {c^}, 
quasipolynomialljQ simulates Frege systems jJer 09] [BGGP10] . It is conjectured that this 
can be improved to a polynomial simulation, and so proving nontrivial lower bounds for 
KS U {ct} is likely equivalent to proving them for Frege systems, a task which has escaped 
proof complexity theorists for years. 

However this quasipolynomial simulation relies crucially on the presence of “dag-like 

A 

behaviour”, manifested in deep inference by a particular rule, cocontraction : c|-. With- 

A a A 

out it we have a minimal complete system closed under deep inference, KS. This system 
is free of compression mechanisms, in the sense that a proof of a conjunction can be ‘par¬ 
titioned’ into proofs of each conjunct, unlike proofs in systems that are dag-like or contain 
cut. This is explained further in |Dasl2a| . 

2012 ACM CCS: [Theory of computation]: Logic—Proof theory. 
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i A quasipolynomial in n is a function 2 log n . 
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It is conjectured that KS is unable to polynomially simulate KSu{ct} [BG09j [BGGP10] 
[Daslll |Strl2j , raising the question of where exactly it fits in the hierarchy of proof systems. 

Atomic flows are diagrams that track structural changes in a proof (duplication, cre¬ 
ation and destruction of atoms) but ignore logical information. In this work they serve as 
a useful abstraction because of certain rewriting procedures on them which can be used to 
manipulate derivations soundly without any mention of logical syntax. Atomic flows are 
introduced formally in |GG08] and a comprehensive account can be found in |Gun09| . 

In this paper we focus on upper bounds and simulations to demonstrate the relative 
strength of KS. The starting points in our arguments are proofs in a system obtained by 

A 

extending KS U {cf} by the coweakening rule: w| —; the resulting system KS U {ct, wj"} is 

denoted KS + in this paper. We then appeal to sound rewriting rules on the atomic flows 
of these proofs to show that cocontraction and coweakening steps can, in certain cases, be 
eliminated from a proof in polynomial time. 

It is worth mentioning here that the addition of coweakening makes little difference to 
proof complexity, indeed it is not difficult to see from the rewriting rules in Fig. [T| that 
coweakening steps can be eliminated in time linear in the size of the proof. Rather the 
real generators of complexity in our proofs are the interactions between contraction and 
cocontraction nodes in the atomic flows, as we show in Prop. 13.101 and Lemma 13.131 

In Sect. [4] we give a simple example of how atomic flows can be used to normalise a 
naive encoding of truth table proofs in KS + to produce a polynomial simulation in KS. As 
a corollary we obtain a super polynomial separation of KS from tree-like cut-free Gentzen 
systems, since they are unable to simulate truth tables |D’A92j . a new proof of a result 
appearing in [BG09] . 

In Sect. [5] we consider stronger systems; we improve a result of Jerabek’s that KS + has 
polynomial-size proofs of the functional and onto versions of the propositional pigeonhole 
principle by showing that they can be polynomially transformed into KS proofs of the same 
conclusions. This immediately entails that cut-free Gentzen systems, Resolution and even 
bounded-depth Frege systems are exponentially separated from KS by the results of [PBI93 ] 
and [KPW95] . 

In Sect. El we consider simulations in KS of other proof systems. There is already a 
naive simulation of tree-like cut-free Gentzen sequent calculi, appearing in |BC()9| . and what 
amounts to a polynomial simulation of ‘Resolution with multisets’ is outlined in |Gug03| . 
Here we formalise the latter result and also give a polynomial simulation of tree-like Resolu¬ 
tion systems, even when sets are the basic data structure. We show that both simulations 
extend to so-called Resolution(/) systems, introduced by Krajicek in jKraOlj and known to 
be strictly stronger than usual versions of Resolution |SBI02| jEGM04] . 

This paper is a full version of [Dasl2bj . and differs from that work as follows: 

(1) Full proofs are given where they were brief or omitted previously. 

(2) We expand on some of the preliminary work on the complexity of normalisation induced 
by flow rewriting in Sect. [2j In particular we provide full proofs of termination and 
confluence for the rewriting system norm and give explicit reduction strategies that 
achieve the complexity bounds given in the previous work. 

(3) In Sect. [2] we use the definitions and notations found in [GG08 ] and [Gun09j for atomic 
flows, to maintain consistency with the existing literature, whereas in jDas 12b ! self- 
contained definitions were preferred for brevity. 
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(4) There were some errors in the statements of results in the previous work, which have 
been corrected here. In particular the previously stated simulations of dag-like cut-free 
Gentzen systems and dag-like Resolution systems are incorrect as presented and we 
have not been able to amend them. Here we only obtain such simulations for certain 
cases, namely the versions of Resolution in Sect. [6j 


2. Deep inference, atomic flows and normalisation 

In this section we introduce deep inference systems for propositional logic and atomic flows , 
diagrams that trace the structural changes in a proof. We consider a graph rewriting system 
norm on flows, corresponding to sound manipulations on proofs, and analyse the complexity 
of termination in this system. 


2.1. Deep inference. We consider propositional logic with formulae constructed from lit¬ 
erals (propositional variables and their duals), also called atoms, over the basis {T, _L, A, V}, 
and use the infix symbol = to denote equivalence of expressions. The variables a, b, c, d 
range over literals, with a,b,... denoting their duals, and A, B, C, D range over formulae; 
both sets of variables may include subscripts or superscripts as necessary. 

For clarity we use square brackets [, ] for disjunctions and round ones (,) for conjunc¬ 
tions. We generally omit external brackets of an expression, and also internal ones under 
associativity. This does not cause any confusion when it comes to proofs, since any valid 
bracketing can be reduced to any other by the = rule in Dfn. 12.11 

Note that we do not have a symbol for negation in our language, formulae are always 
in negation normal form. We may however write A to denote the De Morgan dual of a 
formula A, obtained by the following rules: 

i = T, T = _L, a = a, A/\ B = iv B, Av B = A a B 


Definition 2.1 (Rules and systems). An inference rule is a sound binary relation on for¬ 
mulae decidable in polynomial time, and a system is a set of rules. We define the rules 
we use below, and the systems KS = {aij., awj,, ac|, s, m}, KS + = KS U {awt, act} and 
SKS = KS + U {ai|}. 


Atomic structural rules 


Linear logical rules 


T 

ai4-- 

a v a 

identity 


± 

aw.j. — 

a 

weakening 


a v a 

acj,- 

a 

contraction 


A a [B v C] 
(AaB)vC 
switch 


a a a 



cut 


a 

awt — 

T 

coweakening 


a 

act- 

a a a 

cocontraction 


(4aB)v(Ca D) 

m - 

[A v C\ a [B v D] 
medial 


Note in particular our distinction between variables for literals and formulae in the 
above rules, and between ‘structural’ and ‘logical’ rules. 

We also have the logical rule = which is obtained by closing the equations below under 
reflexivity, symmetry, transitivity and by applying context closure. We implicitly assume 
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that it is contained in every system. 

Commutativity Associativity 


Units 


AvB = Bv A [Av B}vC = Av[BvC] -4 v T = A 

AaB=BaA (AaB)aC = A a {B a C) AaT = ^ 

T vT = T 

Context closure: if A = B, ★ G {A, V}, then C * A = C * B _I_a_|_ = _|_ 


A proof that = is decidable in polynomial time can be found in |BOQ9| . Essentially both 
formulae are just reduced to some canonical form and then compared. Consequently we 
often omit occurrences of the = rule in proofs and derivations. 

Definition 2.2 (Proofs and derivations). We define derivations and premiss and conclusion 
functions (pr and cn respectively) inductively. 

(1) Each formula A is a derivation with premiss and conclusion A. 

(2) If <h and T are derivations and ★ £ {A, V} then (<P a T) is a derivation with premiss 
pr(<f>) * pr('P) and conclusion cn(<b) Acn('I'). 

cn(<P) <h 

(3) If <h and 'P are derivations and p -is an instance of some inference rule p then p — 

V ' pr('P) H V 

is a derivation with premiss pr(<P) and conclusion cn('P). 

If pr(<P) = T then we call <P a proof. If <P is a derivation where all inference steps are 

A 

instances of rules in a system S with premiss A, conclusion B, we write $>||s. Furthermore, 

$¥5 B 

if A = T, i.e. <P is a proof in a system S, we write . 

While our structural rules only have atoms in their premisses and conclusions, the 
notion of derivation above allows us to extend these to arbitrary formulae, as stated in the 
proposition below. We often use these ‘generic rules’ rather than their full derivations for 
convenience. 


Proposition 2.3 (Generic rules). Each rule below is derivable from s, m, and its respective 
atomic structural rule in polynomial time. 


if 


T 


Av A 


_L 

wi — 

A 


4 


Av A 


A 


it- 


A a A 


_L 


A 

wf — 


cf 


A 


A a A 


Proof. See jBTOlj for full proofs. Here we just give an example of the case for contraction, 
since that is the only structural rule of the sequent calculus that cannot be reduced to 
atomic form |Brii03j . The proof is by induction on the depth of the conclusion of a cf step. 


[Av5]v[Av B] 

4 - 

AvB 

(AaB)v (A a B) 

4- 

A a B 


[Av5]v[Av B] 
AvA B v B 

- V cl- - 

A B 

(A a B) v (A a B) 

AvA B v B 

c.j,- A c! • 


A 


B 


Note that the case for cocontraction is dual to this: one can just flip the derivations upside 
down and replace every formula with its De Morgan dual. □ 
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Definition 2.4 (Complexity). We define the size |4>| of a derivation <J> to be the number of 
atom occurrences in <h. A system S polynomially simulates a system T if each proof in T 
can be polynomially transformed into a proof in S of the same conclusion. 

2.2. Atomic flows. We give only an informal definition of atomic flows here, but refer the 
reader to jGG08| . jGun09| for a formal account of atomic flows. 

Definition 2.5 (Atomic flows). For an SKS derivation we define its atomic flow, //(<!>), 
to be the diagram obtained by tracing the path of each atom through the derivation, des¬ 
ignating the creation, duplication and destruction of atoms by the following corresponding 
nodes: 


T 

ait- 

a v a 


a a a 


ait 


A 



_L 

v 

a v a 

aw7 — 

7 

ac^- 

a 


a 

a 


a 

awt — 

->• i 

act 

T 

A 

a a a 



We do not have nodes for s, m or = since they do not create, destroy or duplicate any atom 
occurrences. 

More generally an atomic flow, not necessarily of a derivation, is a (vertically) directed 
graph embedded in the plane generated from the six types of node above. 

Atomic flows are considered equivalent up to continuous deformation preserving the 
(vertical) ordering of connected edges. Note that edges may be pending at either end. 

We define the size of a flow cf>, denoted |</>|, to be its number of edges. 

In previous works atomic flows have been equipped with a labeling of the edges, or a 
polarity assignment, for example to avoid the following impossible situation: 


Since we are only concerned with the complexity of flows and their transformations we do 
not include this extra structure; this does not affect the soundness or termination of our 
rewriting systems, and in fact is crucial in order to obtain confluence, for which labellings 
of edges can cause problems. 

We do, however, insist that edges are vertically directed and so we are often able to 
talk about one node being ‘above’ another node. Notice that this order is not generally 
preserved under deformation, e.g. if two nodes are in disconnected components. Whenever 
we use this notion in arguments it should be clear that it is being used correctly. 


Definition 2.6. A flow rewriting rule is an ordered pair of flows, written (j> i/j. A flow 
rewriting system (FRS) is a set of flow rewriting rules. A one-step reduction of a flow (p in 
an FRS r is a flow ip that is obtainable from (p by replacing some induced subgraph that is 
the left hand side of some rule in r with its right hand side. 

Definition 2.7. We define a graph rewriting system norm on flows in Fig. [U 

The system norm is essentially the system cUw in |GG08| . without the rules for if. The 
proof of termination that follows is similar to that for cycle-free flows in |GG08| . and the 
proof of confluence is similar to that in [GGSlOj . 
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w^-cj-: 




'4--wt:I"T 



w^-cf: 


I T 


w|-w|: 


I 


c|-wt: 


i 1 


cUf: 



Figure 1: Local rewriting rules for the system norm. 


Notation 2.8. Let r be a flow rewriting system. We use the following notation: 

• We write </>—>• ^ if there is a one-step reduction from a flow ^ to a flow ?/> using a rule in 

r 

r. 

• We denote by —> the reflexive transitive closure of —K 

r r 

• If a flow (j) has a unique normal form under r we denote it by 

In all cases we might omit the subscript r if it is clear from context. 

Example 2.9. We give an example of a flow associated with a derivation in Fig. [2], as well 
as a reduction under norm, as defined in Dfn. 12.71 applying wf rules first. 

The first equality follows by the definition of a flow, the second by deformation and the 
final by definition again. The intermediate steps are as follows: 

(1) Apply c|-w| on the left and c|-ct on the right. 

(2) Apply w^-wj' on the left, i|-w| in the middle and i|-ct on the right. 

(3) Apply w|-ct on the left. 

(4) Apply w|-c| twice on the left. 

The use of colours in the initial and final flow identifies which edges corresponds to 
which atoms; in the intermediate flows the colours should aid the reader in reconstructing 
the corresponding transformations on the derivation. 

We now proceed to prove that reducing under norm is terminating and confluent. For 
this we use the usual critical pair lemma, that one only needs to check that the one-step 
contracta of every overlapping pair of redexes are joinable in order to conclude local conflu¬ 
ence. It is simple to see that this is still valid in the flow rewriting setting, since the only 
other case to check for flows is when the redexes are disjoint, which is trivial. We point out 
that Newman’s lemma, that any locally confluent terminating system is confluent, is true 
more generally for any binary relation on any set, and so indeed holds in this setting. 

Theorem 2.10. —> is terminating and confluent. 

norm 

Note that we are using downward arrows in both the names of deep inference rules and to denote normal 
forms under rewriting systems. Unfortunately both notations are standard in their respective literature, 
however there should be no ambiguity in their usage so hopefully this will cause little confusion. 
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fl 


T 


auj, • 


T 


a v aij. - 


■ a a 


a v a 


a v (a a ) 


acj, 

act 


a v a m 


a V 


a a a 


(a A ) V (_L A awj. —) 

a 

ava 

flVl act —i— 

: - a a 


a 


awt — 




Proof. For a node v in a flow </>, let d(v, (jf) be the distance of u from the top of i.e. the 
minimum length of a (vertically) directed path from v to an aij, node, an awj, node or an 
edge with upper end pending. For an atomic structural rule p, let D(p,(j)) be the sequence 
of natural numbers that counts how many p nodes in <f have each d-value, i.e. the sequence 
(rii) such that, for each i, n* is the number of p nodes v in </> with d(v, i f>) = i, and consider 
the lexicographical ordering < on such sequences, with (m;) < (nj) just if rrik < nk for some 
k and rrii < rii for i > k. 

Clearly the rules c|-ct, i|-ct and w|-ct strictly reduce the D( act,') value of a flow, 
while the other rules of norm (as well as w^-cj') strictly reduce a flow’s size, while not 
increasing the D( act, •) value. Therefore each application of a norm rule strictly reduces the 
lexicographical product D( act,-) x | ■ |. 
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Since norm is terminating, it suffices to check local confluence for the critical pairs, 
which are the following by inspection: 


(1) 





Note that every other overlapping pair can be deformed so that each rule application trivially 
commutes. We consider each case below. 



(6) This case is dual to (1). 

(7) This case is dual to (2). 

(8) This case is dual to (3). 



The cases where we appeal to duality follow by simply flipping the indicated reductions 
upside down and relabeling nodes and reduction steps appropriately. □ 
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The significance of the rewriting system norm is evident from the following results, 
where we show that reduction steps correspond to sound manipulations of SKS derivations. 

Definition 2.11. If 1Z is a relation on flows we say that 1Z lifts polynomially to SKS if, 
whenever (fl($>),ijj) £ 1Z, we can construct a derivation T in time polynomial in |4>| + \if\ 
with the same premiss and conclusion as and flow if. If / is a function on flows then we 
say that / lifts polynomially to SKS just if the relation /(•) = • lifts polynomially to SKS. 

Sometimes we simply say that an individual flow rewrite rule is sound rather than 
saying that it lifts polynomially to SKS. 

Remark 2.12. In fact a derivation can always be manipulated (preserving premiss, conclu¬ 
sion and flow) so that it has size at most polynomial in the size of its flow, as (essentially) 
shown in jDas!3j . so the dependence on size of derivation in Dfn. 12.111 is somewhat redun¬ 
dant. However this is beyond the scope of this work, and it does no harm for us to include 
this dependence. 


Theorem 2.13. —> lifts polynomially to SKS. 

norm 


Proof. See )GG08] . Essentially the proof shows that each local rewrite step on a flow of a 
derivation induces a sound manipulation of polynomial size on that derivation. We give as 
an example the case for c{,-ct, since that rule is of particular interest in this work. 

Let £{ } denote an arbitrary context, i.e. a formula with a single hole occurring in place 
of a subformula, and let denote the result of substituting a formula A for the hole in 

f{ }. The manipulation is as follows: 


'I 


f a 

a 

a v a 


acf - V act 




aAa 

aAa 

a J 

f < 

m - 




a v a 

a v a 



ac.j. - A ac4- 




a 

a 

a 




> 

<!>{(a/(aAa)} 


a a a 1 


({aAa} 



£ j ac t 

4-cf: -K 

C j ac t 

where 4>{a/(a a a)} is obtained by replacing a by (a a a) everywhere in 4>. 


□ 


Corollary 2.14. Given an SKS derivation and a reduction //(<!>) = f \ —> ■ ■ ■ —> 4> n = 

norm norm 

/I(4>) j, norm we can construct an SKS derivation with the same premiss and conclusion as <f> 

n 

and with flow //(<h)4- norm in time polynomial in l^l + fP \<pi\. 

i =1 


Proof. By induction on the length n of a norm derivation. 


□ 


2.3. Reduction strategies. We analyse the complexity of normalising a flow under norm, 
presenting a class of reduction strategies that optimise the size of a —> derivation from a 

norm 

flow to its normal form, up to a polynomial. Our aim is to prove the following theorem: 
Theorem 2.15. The function 4norm mapping a flow to its unique normal form under —> 

norm 

lifts polynomially to SKS. 
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The result follows from Cor. 12.141 if we can find appropriate reductions with size only 
polynomially dependent on the initial derivation and normal form of its flow. 

Example 2.16. Consider the following flow, 


where there are n act nodes. Notice that this flow has normal form }, but the complexity 
of a derivation witnessing this can vary significantly. If we apply ct-ct steps first then the 
length of a derivation to normal form is 0(2”), whereas applying {wt-ct, w|-ct} steps first 
results in length 0(n). These bounds follow from later results in this section, but are not 
difficult to prove directly. 

In fact, this sort of unnecessary exponential blowup can always be avoided by applying 
‘weakening’ rules first. 

Definition 2.17. We define the following subsystems of norm: 

(1) wk = {\a4-c),, i|-wt, cf-wt, v\4-ct, w|-wf, c|-wt}. 

(2) cont = norm \ wk = {c|-ct, 

Proposition 2.18. Given a flow cp we have: 

(1) If cp —> ip then \(p\ > \ip\. 

wk 

(2) If cp -A if then \<p\ < \ip\. 

cont 

Lemma 2.19. Given a flow cp we have (</Hwk)lcont= 'Mnorm- 

Proof. By inspecting the rules of cont, we observe that if there are no wk-redexes in a flow 
ip and ip —> 0 then there are no wk-redexes in 6. By induction on the length of a —> - 

cont cont 

derivation we then have that there are no wk-redexes in (^>i w k) icont- But since there are 
also no cont-redexes, by definition of normal form, and wk U cont = norm we can conclude 
that (<Mwk)lcont i s already in normal form for norm. □ 

We can now give a proof of the main theorem of this section. 

Proof of Thm. \2.15[ Let $ be an SKS derivation with flow <p whose normal form under norm 

is ip, and fix a derivation cp = cp\ -A ■ ■ ■ -A (p m = ip± —> ■ ■ ■ -A ip n = ip, which must exist by 
_ wk wk cont cont 

Lemma 12.191 

By Cor. 12.141 we can construct an SKS derivation T with same premiss and conclusion 

m n _ 

as 4> and flow ip in time polynomial in |<I>| + Yl, ^**1 + S IV’il- However, by Prop. 12.181 

i =l j =l 
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we have that \(fi\ < |<?h| = \<f>\ < |<b| for all i and \ifj\ < \if n \ = \if\ for all j, and so this 
construction can be done in time polynomial in (m + n)(|<h| + \if\). 

Finally, notice also by Prop. [2J8] that each —>• step decreases the size of a flow, so m is 

wk 

bounded above by \<f\ (and so also |4>|), and each cont step increases the size of a flow, so 
n is bounded above by \if\, whence the result follows. □ 


3. Complexity of normal forms 

In this section we specialise previous results to KS + flows, reducing the complexity of norm 
reduction to counting the number of certain paths in the initial flow. We then give two 
simple ways of estimating this number, which we use in later sections to obtain complexity 
results. 

Proposition 3.1. If cj) is the flow of a KS + proof, with normal form if under norm, then 
if is free o/aw| and act nodes, i.e. contains just KS nodes. 

Proof. We argue by contradiction. Notice that by cj.-ct we can assume that all act nodes 
are above all act nodes in if, by deformation, and so must have upper end directly connected 
to an awt or ait node, since cf is associated with a proof. However then if can be reduced 
by either it-ct or wt-ct, contradicting normality. An awt node, similarly, must be above all 
act nnd act nodes by ct-wt and ct-wt, and so must have upper end directly connected to a 
awt or a 't node, and again if can be reduced by wt-wt or it-wt, contradicting normality. □ 

Notice that the above proposition, along with previous results in this section, allows 
us to transform KS + proofs to KS proofs of the same conclusion. We state this formally 
in Thm. 13.81 once we have determined more about the complexity of this transformation, 
which we address in the proceeding results. 

Definition 3.2. An ai -path is a (simple) path that changes (vertical) direction only at aij, 
and ai)' nodes. We say that an ai-path is maximal if it cannot be extended, and that it is 
open if it begins and ends at a pending end of an edge. 

The inversion of a path is just the same path in the reverse direction. 

Example 3.3. The paths on the right, and their inversions, are exactly all the maximal 
ai-paths of the flow below. All of these are open except 0, since one of its ends is a awj 
node. More generally all open paths are maximal but not vice-versa. 



23679, 23678, 
4578,4579, 
0 , 1 . 


The following results allow us to estimate the size of the normal form of a flow, under 
norm, without actually constructing it. 


Observation 3.4. —> preserves the number of open ai-paths in a flow. 

norm 

Notation 3.5. We write r 0 n to denote the number of open ai-paths in a flow cf, modulo 
inversions, and ff(p,cf) to denote the number of p nodes in (f for an atomic structural rule 
P- 


Lemma 3.6. If cf is the flow of a KS proof then (f). 
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Proof. Since the flow of a proof can have no edge with upper end pending, every edge must 
be path-connected to a awj, or ai0 node. Since no open path goes through an awj, node, 
and there are no ait nodes, every open ai-path goes through a unique aij, node, and every 
ait node accommodates at least one such path since there are no awt nodes. 

Now, the only other node a path can go through is an act node. Consequently every 
node in an open ai-path has in-degree 1 before passing its unique ait node, and out-degree 
1 after. Hence each ait node accommodates exactly one open ai-path. □ 

Lemma 3.7. If cf is the flow of a KS + proof, with normal form if under norm, then \if\ = 

O(|0| + r tc). 

Proof. Let x = 0| w k so that cf A x —> if by Lemma 12.191 By inspection of the rules of wk 

wk cont 

we must be able to decompose x into two disjoint components: \i , consisting of just ait, 

act and act nodes, and X 2 consisting of just awt and awt nodes. In fact, since 0 was the 

flow of a proof, X 2 consists of just awt nodes. 

Now notice that any —>• derivation from x to if acts only on xi> and so if can be 
cont 

decomposed into two disjoint components: ifi, which is the normal form of xi under norm, 
consisting of just ait and act nodes by Prop. 13.11 and 02 = X 2 , consisting of just awt nodes. 
We have that |- 02 1 = |X 2 1 < |x| < \<f \ by Prop. I2H81 

Notice that \ifi\ = 2 • ff( ait,"01) + #(act, 0i), since each ait node has two edges and 
each act node adds a single extra edge. Since an act node has in-degree 2 and out-degree 1, 
the number of act nodes cannot outnumber the number of edges emanating from ait nodes 
in ifi, i.e. #(act, ifi) < 2 • 0i), so we have |0i| < 4 • ff(a\f,ifi). By Lemma HTTH we 

then have that \ifi\ < 4 • r 0 n , and by Obs. 13.41 that \ifi\ < 4 • r 0 n . 

Putting these together we obtain \if \ = \iffl\ + | 02 | < |0| + 4 - r 0 n as required. □ 

Theorem 3.8. If & is a KS + proof with flow (f then we can transform it to a KS proof of 
the same conclusion in time polynomial in |<b| + r 0 n . 

Proof. Let if be the normal form of cf under norm. By Prop. 13.11 we have that if contains 
just KS nodes, and so by Thm. 12.151 we have that a KS proof with same conclusion as <f> 
can be constructed in time polynomial in jd*! + \if\. The statement follows by the bound 
\if\ = O(|0| + r 0 n ) = 0(|<f>| + r 0 n ) given by Lemma [3~T1 □ 

3.1. Length of atomic flows. It is not difficult to see that the main contributor to an 
increase of flow size reducing under norm is the rule c0-ct- It can sometimes cause an 
exponential blowup, as evident in Ex. 12.161 if there were no awj, node at the top. 

The following result provides a simple estimate of the number of paths in a flow, and also 
the complexity of flow normalisation under norm, and is just an adaptation of well-known 
techniques to estimate the number of paths in any directed acyclic graph. 

Definition 3.9 (Dimensions of a flow). The length of a flow is the maximum number of 
times the type of node changes in an ai-path. The width of a flow is the maximum size of 
a connected subflow containing just one type of node. The breadth of a flow is the number 
of connected components it has. 

The above definition is perhaps most easily understood by allowing acj, and act nodes 
to have unbounded in-degree and out-degree respectively. For example, by just collapsing 
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any configuration of n 


1 connected act nodes to a single node 


, and similarly for 


configurations of act nodes. Then we can consider the length of the flow to be essentially 
just the maximum length, in the usual sense, of an ai-path, and the width is the maximum 
out-degree or in-degree. 

It is worth mentioning here that replacing act and set nodes with these ‘super’ nodes 
can be lifted polynomially to proofs, in the sense that act an d act steps in a proof can be 
soundly replaced by similar ‘super’ steps with only linear change in size. 


Proposition 3.10. If cf is a flow consisting of just KS + nodes with width w, length l and 
breadth b, then r fP = b ■ w 2 +°( 1 t 


Proof. For simplicity we write for some configuration of n — 1 connected ct nodes, and 

similarly for a ct configuration. Notice that it suffices to consider the case when 6=1, since 
paths in different connected components are disjoint. 


In the worst case scenario we just have a sequence of 


configurations in series 


vertically, and each configuration multiplies the number of paths by w. It does not make 


much difference if there is a 


W \ ( W 


configuration at the top of the flow because of the 


way we have defined an ai-path; the exponent differs by just the addition of a constant. □ 


Remark 3.11. We generally use the trivial upper bound of size of flow for width and 
breadth, yielding the estimate r fP = |</>| 2 + °(h f or a ^g+ flow (j>. 


3.2. Contraction loops in atomic flows. Sometimes the estimate of number of open 
ai-paths given via length of a flow in the previous section is not sufficiently accurate. An 
example of this is given in Sect. 16.21 where the length of flows in the translation R is 
polynomial in their size, and so the estimate is exponential, yet the actual number of open 
ai-paths is bounded above by a polynomial. 

Unsurprisingly, it is only certain interactions between act and act nodes that generate 
significant complexity in a KS + flow; in this section we focus on one type of interaction we 
call a ‘contraction loop’. In the absence of these the number of open ai-paths in a flow is 
still polynomial in its size, regardless of its length. 

Definition 3.12. A contraction loop in a flow is a pair of (act, set) nodes (iq, vfl) such that 
there are at least two disjoint (directed) paths between u\ and zz 2 
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For example we give the following flow and all its contraction loops, 


{u,y), 0 ,z). 


whereas every other pair has only one path between them. If the edge ★ were broken and 
there was no path from w to x then there would be no contraction loops at all. 

Lemma 3.13. If there are no contraction loops in a KS + flow f> then r fP < |t>| 3 . 

Proof. For an edge e in <f consider the following two notions: 

• The weight of e, denoted w(e), is the number of directed paths from e to the bottom of 
< i i>, i.e. to a awt node or an edge with lower end pending. 

• For an atomic structural rule p let N(p, e) denote the number of p nodes below e that are 
connected to e by a directed path. 

We show that w{e) < IV (awt, e) + N{ act, e) + 1 by induction on the distance of e from the 
bottom of 4>. The inequality is clear for the base cases when e is directly connected to a 
awt node, , and when e has lower end pending, e| . We have two inductive steps: 

(1) e is an upper edge of a act node, . In this case we clearly have that w(e) = w(8) 

and so the inequality follows by the inductive hypothesis. 

(2) e is the upper edge of a act node, • Observe that, since there are no contraction 

loops in <f and act is the only node type with in-degree greater than 1, any node below 
this act node can be directed-path-connected to at most one of 7 or 5. Consequently we 
have that IV(awt, e) = IV(awt, 7) + IV(awt, <5) and IV(act, e) = IV(act, 8) + IV(act, 7) + 1. 
Therefore, 

w(e) = w(8) + w( 7) 

< (IV(awt, 8) + IV(act, 5) + 1) + (IV(awt, 7) + N(ac\, 7) + 1) 

< (IV(awt, 8) + IV(awt, 7) + (JV(act, 8) + IV(act, 7) + 1) + 1 

< IV(awt, e) + IV(act, e) + 1 

Finally, notice that \<f \ > IV(awt, 4>) + IV(act, 4>) + 1 and so we have shown that w(e) < \4>\. 

Clearly the number of open ai-paths going through an edge with upper end pending is 
bounded above by its weight, and so by \<f>\ by the bound above, while the number of open 
ai-paths going through any ait node is bounded above by the product of the weights of each 
of its edges, and so by \4>f ■ In particular we have that the number of open ai-paths going 
through any edge at the top of a flow is bounded above by |t>| 2 , and there are at most \<f\ 
many such edges, whence the bound follows. 

□ 
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Example 3.14. In fact the bound given above is optimal, up to multiplication by a constant. 
Consider the flow f below, where there are n aij, nodes: 



V V 


Clearly the flow has size linear in n and notice that the weights (as defined in the above 
proof) of the topmost edges, starting from the left, are 1,2 ,... ,n,n,..., 1 respectively. 
Consequently the number of open ai-paths going through the aij, nodes, starting from the 
outside, is l 2 , 2 2 ,..., n 2 respectively, and so taking the sum we obtain = 0(n 3 ). 

Notice also that 4> has length linear in n and width 2, and so the upper bound on r <jP 
given by Prop. l3T0l is exponential, considerably worse than the bound given in Lemma [3.131 


4. Truth tables and tree-like Gentzen systems 

KS polynomially simulates tree-like cut-free Gentzen systems since its rules are generalisa¬ 
tions of Gentzen rules [BT01| [BG09] , In the other direction Bruscoli and Guglielmi have 
proved in |BGfl9] that the converse does not hold, by way of the so-called ‘Statman tau¬ 
tologies’. We offer a new proof here, via truth tables, as an exercise prior to the main 
results. 

Let LK~ denote the cut-free sequent calculus, and tr ee-LK~ denote the system of 
tree-like proofs in this calculusH 

Proposition 4.1 (D’Agostino). Tree-LK~ cannot polynomially simulate truth tables. 

Proof. See |D’A92| . □ 

To expand slightly on the above proposition, truth tables are efficient when there are 
exponentially many occurrences of each atom, and some such tautologies are hard for tree- 
LK~. One such example, used by D’Agostino, is simply the disjunction of every assignment 
on k propositional variables, what we call below. Such a formula has size k ■ 2 k , 

although any tree-like cut-free sequent proof must contain at least k\ branches, while a 
truth table contains 2 k rows and k ■ 2 k columns. 

Lemma 4.2. KS + polynomially simulates truth tables. 

3 It does not matter too much which version of the sequent calculus we choose as complexity properties 
are typically preserved across common variations. 
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Proof. Let r be a tautology. For each partial assignment A, defined on just those variables 
appearing in r, and each formula A satisfied by A construct a derivation $4 (.A) by structural 
induction on A as follows: 

*A(B) 

$ 4 (a) = a , $ A (B a C) = <fu(5) a ^(C) , <^ A (BvC)= ± 

JD V wj, — 


where, in the last case, when A is a disjunction, the disjunct B was chosen such that B is 
satisfied by A. It is clear that each 4 >_ 4 (t) has conclusion r and premiss a conjunction of 
literals; moreover this conjunction of literals is satisfied by A. 

Let 7_4 be the conjunction of all literals satisfied by A, so that each variable appears 

7 A 

exactly once. Then we can easily construct derivations {awt.act}- 

P r ($a(t)) 

Now construct a proof T of \J ^ 7A hr {aij., acf, s, m} by induction on the number of 
distinct variables, as shown below: 


T 

Base case: a 4 - 3 , Inductive step: \J 

a^ a v 


A 


4'||{ai4.,act,s,m} 

_ 1A \ 

T . 74 

a 14 .-A cf 


2-s 


\ ay a 74 A 74/ 


\Ja (° A 7-4) v \Ja ( a A 74 ) 

Finally we put these together and apply contractions to obtain a KS + proof of r: 

\ 


V 


A 


( 7 A 

|| {awf,act} 

{wt} 


V 


{4} 


It is clear that the derivations inside the large parentheses have size polynomial in |r|, which 
is the number of columns in a truth table, and the number of these derivations appearing 
in disjunction is just the number of assignments, which is the number of rows in a truth 
table. □ 


Theorem 4.3. KS polynomially simulates truth tables. 

Proof. Notice that, in the above simulation, all act steps are above all acj, steps, and so the 
associated flow will have bounded length. The result follows by Cor. 12.141 Lemma 13.71 and 
Prop. EGD □ 

Corollary 4.4. Tree-LK~ cannot polynomially simulate KS. 

Proof. Immediate from Prop. 1441 and Thm. 14.31 □ 
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Remark 4.5. It should be noted that D’Agostino’s separation is only quasipolynomial0 and 
so our separation is also only quasipolynomial, while the proof using the Statman tautologies 
in |BG09| yields an exponential separation. Nonetheless, an exponential separation follows 
from the results in the next section. 


5. Separations via variants of the pigeonhole principle 

Jerabek has shown that KS + has polynomial-size proofs of the functional and onto variants 
of the pigeonhole principle [,Ter09j . We show that these proofs reduce under norm to KS 
proofs with only polynomial increase in size, and so cut-free sequent calculi, Resolution and 
bounded-depth Frege systems are unable to polynomially simulate KS. 

The pigeonhole principle states that, if n pigeons sit in n — 1 holes, some hole must 
contain more than one pigeon. In its unrestricted formulation, the mapping from pigeons 
to holes can be many-many, while in the functional variant it must be many-one, i.e. a 
function, and the onto variant insists that each hole must be occupied. It is clear that 
the two variants are weaker than the unrestricted version, and the variant containing both 
criteria, the onto functional pigeonhole principle, is weaker still. We will see this more 
clearly in the following definition. 

The propositional encodings of pigeonhole principle variants below are most easily un¬ 
derstood by interpreting the atoms a tJ as “pigeon i sits in hole j”. Recall that it does not 
matter how large disjunctions and conjunctions are bracketed, since any valid bracketing 
can be easily reduced to any other by the = rule. 

Definition 5.1 (Pigeonhole principles). We define the following formulae, 

n n n n n n 

PHP n = \J CLij V \^J (Qij A (l f j /) . F n = \J \J ( Clij A dj/j ) ■ O n — \J Clij 

i=0.j=l i=0j<j' j=li<i' j = 1 i=0 

and denote by FPHP n , OPHP„ and OFPHP n the formulae obtained by putting in disjunction 
the associated formulae, i.e. FPHP n = F n v PHP n , OPHP n = O n v PHP n and OFPHP n = 
O n v F„ v PHP n . 

We can see in the above definition that any variant can be obtained from a stronger 
variant, i.e. one with a subset of disjuncts, by a simple application of generic weakening wj,. 
Consequently upper bounds on the size of proofs of one variant yield upper bounds for all 
weaker variants, and lower bounds vice-versa. 

The following result was proved by Beame, Impagliazzo and Pitassi, and independently 
by Krajicek, Pudlak and Woods. 

Theorem 5.2. Bounded-depth Frege systems have only exponential-size proofs o/OFPHP ra . 

Proof. See [KPW95| . |PBI93| . □ 

Corollary 5.3. Bounded-depth Frege systems, Resolution and cut-free sequent calculi have 
only exponential-size proofs of all variants of the pigeonhole principle. 

Proof. All the systems are just special cases of bounded-depth Frege, and a proof of any 
variant can be extended to one of OFPHP„ by an application of generic weakening wj,. □ 

4 A quasipolynomial is a function of size n log_( n . By taking logarithms and using the integral bound it 
is not difficult to see that k\ grows quasipolynomially in 2 k . 
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On the other hand we have the following: 

Theorem 5.4 (Buss). There are polynomial-size Frege proofs of PHP nj and so all variants 
of the pigeonhole principle. 

Proof. See |Bus87j . □ 

From here, polynomial-size SKS proofs of PHP„ follow from the following observation: 

Proposition 5.5 (Bruscoli and Guglielmi). SKS is polynomially equivalent to Frege sys¬ 
tems. 


Proof. See jBG09j . Q 

Notice that one direction of the above proposition, indeed the direction that we require, 
that SKS polynomially simulates Frege systems, can be obtained by recognising that the 
rules of SKS are just generalisations of the rules of Gentzen systems with cut, which are 
well-known to be polynomially equivalent to Frege systems. 

The following trick, now standard in the deep inference literature, is very useful for 
proving certain tautologies in KS, as we will see. The idea is that if we know there is 
an SKS proof of a tautology, then we can transform it into a KS proof of that tautology 
in disjunction with trivial contradictions. If we can find derivations from each of these 
contradictions to the tautology we want to prove, then the composition mechanisms for 
derivations in deep inference guarantee that we can build a proof of the tautology. 


Lemma 5.6. Let A be a formula over the atoms aq ,... ,a n . Every SKS proof <f> of A can 
be polynomially transformed to a KS proof of A v \ffai a a,). 

Proof. See e.g. |,Ter09j . jBGGPlOj . □ 

Lemma 5.7 (Jerabek). There are polynomial-size proofs o/FPHP ra and OPHP„ in KS + . 

Proof. By Thm. 15.41 Prop. 15.51 and Lemma 15.61 there are polynomial-size KS proofs with 
conclusion PHP n v \/. .(a^ a dij). For each atom a s t we construct a derivations ^ Bt in 
KS + \ {acj,} with premiss a s t a a s t and conclusion FPHP n as follows: 


®st ^ a s t 


T 


i| - 


a s t A «st A /\ asj v \/ a sj 
_ 

( a st \ 

(n—2)-acf-A \J a s j 


A Ojsj V 
j 


v\4 


cist ^ * * * Oj st jz£t 

Ilfs} 

\j &st ^ ^sj 
j¥=t 


FPFIPr 
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We then put these together and apply contractions to obtain proofs of FPHP n : 

\ 


PHP n v \/ 


KS 

/ a a a 

*n 3 ' |KS+\{ac4,} 

\FPHPn )_ 

I (4) 

FPHPn 

We can construct similar derivations from a s t a a st to OPHP„, given below, and put them 
together in a similar way to obtain proofs of OPHP n in KS + . 

&st A ®st 


T 


if- 


2-s 


a st *d st A ^ a . t v y a . t 
i^s i^s 


A a it V 


(n— 2)-acf • 


wi 


as t .. \ 

A ^ 

Q>st a • • • a a s t 

Ilfs} 

\/ Rst A (Xu 
i^s 


OPHP r 


□ 


Theorem 5.8. There are polynomial-size KS proofs o/FPHP„, OPHP n and so also OFPHP n . 

Proof. In the proofs of FPHP n constructed above, Lemma [5.71 notice that the only act steps 
occur in where there are also no act steps, and similarly for OPHP n . It follows that 
there are only two alternations between act an d act steps in the path of any atom from 
an ait step, and so the atomic flows of these proofs will have bounded length. The result 
follows by Thm. 13.81 and Prop. 13.101 □ 

Corollary 5.9. Cut-free sequent calculi, Resolution and bounded-depth Frege systems are 
exponentially separated from KS. 

Proof. Immediate from Cor. [i~3l and Thm. 15.81 □ 


6. Polynomial simulations of versions of Resolution 

In this section we present a polynomial simulation in KS of tree-like and multiset Resolution 
systems. We point out that, in previous work jDas 12b], a more general result was claimed, 
namely a simulation of unrestricted Resolution operating with sets, however that proof 
contained errors that we could not amend for this version of the article. The status of those 
results is unresolved. 

We define the Resolution system below in both its set and multiset formulations. 


















20 


ANUPAM DAS 


Definition 6.1 (Resolution). We use symbols T, A etc. to range over (multi)sets of literals 
and write ‘U’ to denote (multi)set union. We define the system (multiset-)Resolution by 
the following rules: 


r 

wk- 

ru a 


res 


ru{a} A U {a} 
ru a 


A derivation from (multi)sets Ti,..., T s is a lis1@ n = (Ai,..., A n ) where each Aj is some 
Tj or follows by one of the rules above whose premisses have occurred previously in the list. 
If we further have that A n = 0 then we call n a refutation of Ti,..., T s . 

We call a derivation (Ai,..., A n ) tree-like if each A,; is used at most once as the premiss 
of any inference step concluding some Aj. 


To simplify the treatment of (multiset-)Resolution derivations we address certain rather 
pathological situations below. These are also the reason why we opt to include weakening 
in our formulation^ 


Remark 6.2 (Assumptions on derivation format). In a (multiset-)Resolution derivation, 
we can assume that neither premiss of a res step contains both a resolved atom and its dual. 
Any such step would have the following format, 

T U {a} A U {a, a} 

res- 

T U A U {a} 

which can be simulated by an application of wk to the left premiss, if necessary at all. 

We consequently have that no res step has identical premisses, and so there is no need 
to consider this case in the translations that follow. 

Finally, in the case of sets, we further assume that neither the resolved atom nor its 
dual appear in the conclusion of a res step. I.e., in the definition of the res above, we assume 
that T and A contain neither a nor a. Aside from the above situation this phenomenon 
might also occur if, say, a £ T and so T = Til {a}, whence the step may be simulated by 
an application of wk in a similar way. 

Before presenting our simulations in KS of versions of Resolution, we set some notational 
conventions below in order to easily switch between the two settings. 

Notation 6.3 (Set symbols in deep inference). To reduce the amount of syntax in our 
deep inference derivations, we will simply write T instead of V T for the disjunction of the 
members of T, as an abuse of notation. 

We similarly use other (multi)set-theoretic notation. In particular, in light of the above 
remark, we will always have that T U {a} = T v a in both the set and multiset settings 
whenever it occurs. In the multiset setting we further have that T U A = T v A for all T, A. 


Remark 6.4. Throughout this section, when we say that a proof system polynomially 
simulates a refutation system, we mean that every refutation of A in the latter can be 
polynomially transformed to a proof of A in the former. 

^We do not mind how the list is delimited. In the translations below they are written vertically for 
presentation reasons. 

®It is straightforward to show that the formulation without weakening is polynomially equivalent by a 
rule permutation argument. 
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A B 

Definition 6.5 (Dual systems). The dual of a deep inference rule p — is p —. E.g. in SKS 

B A 

a rule x j, is dual to x f, while s and m are self-dual. The set of duals of a system S is 
denoted S. Notice that a rule is sound if and only if its dual is, by the law of contraposition. 

A B 

The dual of a derivation $||s is the derivation ||s obtained by flipping $ upside down 

B A 

and replacing each inference step with its dual. 

Similarly we define the dual of a flow-rewriting rule to be the rule flipped upside-down, 
replacing each node with its dual, and the dual of a rewriting system is just the set of its 
duals. A rule/system is sound, terminating and/or confluent if and only if its dual is. 

We are now ready to define our basic translation from Resolution derivations to deep 
inference, on which our simulation results in later sections will be based. 

Definition 6.6 (Translation of Resolution derivations). We give the following translation 
R of a Resolution step to a KS + derivation, 


T 


R : 


wk 


ru a 


HA- 


r v wj. 


T 


a\t 


ru a 


R 


T U {a} Auffl) 
ru a 


HA- 


r U {a} A U {a} 

: - A = 


T v a 


A v a 


2-s 


r V A V aif 


a a a 


T 


r v a 

(4 — - ) 


ru a 

where the parenthesised cj, steps are present only when T and A are sets (not multisets) 
that have nonempty intersection. 

We extend the definition of R to any (multiset-)Resolution derivation n = (Ai,..., A n ) 
from Ti,..., T s , such that R has the following format: 


R 


Ai 

A„. 


i —t 


A r r 

r =1 

||ks+ 

n 

A A m 

m =1 


The definition is by induction on the length n of the Resolution derivation n. 
If n = 0, i.e. 7r is an empty list, then f\ m A m = T and we define^ 


R : it i-A 


wt 


A r r 

r —1 

T 


'Throughout this translation we shall omit certain = rules required to handle units, e.g. we associate the 
empty disjunction with T and the empty conjunction with T. By now this is a routine consideration. 
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If 7 r is extended by a (multi)set Tj then we define, 


Ai 


R : 


r t 


hA 


A r r 

r —1 


A r r a ct- 

r^i M A M 

A r r 

r=l 

ID | A Tj 
n 

A A m 

m=l 


where the derivation marked ID is 

already defined by induction. 

If 7T is extended by a wk step then we define, 

A r r 

r= 1 

ID | 

Ai 


n 



A A m 

R : Ai 


m= 1 

HA 

Ai 

ct - 



A, 

A n 

A A m 

m^i 

A ||" 

Ai A _R(wk) 

Aj U £ 

Ai U X 



n+ 1 

A A m 

m=l 


where A n+ i = A* U S and the derivation marked ID is already defined by induction. 
If 7T is extended by a res step then we define, 


s 


Ai 

A' U {a} 


A r, 

r=l 

ID | 
n 

A A m 

m=l 



Ai Aj 

A' U {a} 

HA 

ct A ct 

Ai a A.j Aj a Aj 


A Afc A 

Ai a A j 

A n 

m^i,j 

A i A Aj A i?(res) || 

A' U A'- 


a'ua; 



n +1 


A A m 


m=l 


where A* = A' U {a}, A j = A'- U {a}, A n+ i = A■ U A' and the derivation marked ID 
already defined by induction. 
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6.1. Multiset Resolution. We consider the multiset setting, for arbitrary derivations (i.e. 
not necessarily tree-like). Here we achieve a polynomial simulation in KS rather easily by 
using the identities pointed out in Not. 16.31 

Observation 6.7. The image of a multiset-Resolution derivation under R has no ac,( steps. 

Proof. The only ac), steps in the definition of R occur in the translation of individual res 
steps, where the format of the premiss is T v A and the conclusion Til A. However, as 
already pointed out in Not. 16.31 these are equivalent in the multiset setting, and so no ac| 
steps are necessary at all. □ 

Theorem 6.8. KS polynomially simulates multiset resolution. 

Proof. For a multiset resolution refutation 7r of Ti,..., T s we have that Rn is a KS + deriva- 

S 

tion from f\ r r to _L, by definition, not containing acj. steps, by the observation above. 

r=1 

Consequently we have that R7r has the following format, 

T 

Rn I KSulawf} 

V F r 

r =1 

whose flow has bounded length (due to the absence of act nodes) and so can be transformed 
to a KS proof of the same conclusion in polynomial time by Prop. 13.101 and Thm. 13.81 □ 

6.2. Tree-like Resolution. We now consider Resolution derivations over sets and show 
a polynomial simulation of tree-like refutations in KS. Here the argument is only slightly 
more involved: there are both act and act steps occurring, but they do not interact in any 
complex way. 

We point out that the proof of this result could perhaps be carried out more simply by 
writing tree-like derivations as trees and simulating steps locally, discarding premisses once 
they are used. However the current presentation allows us to use the same translation R 
for this and the multiset setting, as well as the extensions introduced in the next section. 
This uniform treatment is possible due to Lemma 13.131 on the number of open ai-paths in 
the absence of contraction loops. 

Proposition 6.9. If it is a tree-like Resolution derivation then every ac'|" node in fl(Rn) 
has one edge pending. 

Proof. By induction on the length of 7r. On inspection of the atomic flows, notice that each 
translation step introduces only cf nodes whose lower left edge is pending (with respect to 
the horizontal order of atoms given in the definition of R in Dfn. 16.6l) . so it suffices to show 
that new nodes are not attached to the lower edge of existing cf nodes. 

If any such situation existed then, by construction, each cf node must be associated 
with the same set Aj that is the premiss of distinct steps in 7r, contradicting the fact that 
7 r is tree-like. □ 
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Recall the notion of contraction loop from Dfn. 13.121 

Corollary 6.10. If ir is a tree-like Resolution derivation then fl(Rir) has no contraction 
loops. 

Proof. Any contraction loop would violate Prop. [6791 above. □ 

Theorem 6.11. KS polynomially simulates tree-like Resolution. 

Proof. For a tree-like Resolution refutation ir of Ti,..., T s , as in the proof of Thm. 16.81 we 
have that Rn has the format: 

T 

R7t|| KS+ 

V f r 

r=1 

We also have that fl(Rn) contains no contraction 
a contraction loop in fl(RTr), violating Cor. 16.101 
and Thm. 13.81 


loops, since otherwise there would also be 
The result now follows from Lemma 13.131 

□ 


6.3. Extensions of Resolution. Finally, we notice that this simulation extends to some 
extensions of Resolution, operating on (multi)sets of conjunctions of literals, introduced by 
Krajicek in [Kra m • These systems are known to be strictly stronger than their counterparts 
we have so far dealt with [I SB1021 [EGM041 . 

Definition 6.12. Let s,t , etc. denote terms, i.e. conjunctions of literals, and T, A, etc. now 
denote sets of terms. We define the following rules, 

T ru{s} A U {t} FU{s} A U {ai,..., Ofc} 

wk- , a - res- 

ruA ruAujsAt} ruA 

k 

where in the res rule we require that s = f\ a^. 

1=1 

For a function / : N —> N, a (tree-like) (multiset-)Resolution(/) derivation/refutation is 
defined analogously to Dfn. 16.11 using the rules above, with the additional proviso that no 
term has size bigger than f(N), where N is the number of inference steps in the refutation. 

Theorem 6.13. For any f : N->N we have the following: 

(1) KS polynomially simulates multiset-Resolution(f). 

(2) KS polynomially simulates tree-like Resolution(f). 

Proof sketch. The proofs are analogous to those appearing earlier for the tree-like and mul¬ 
tiset variants of Resolution respectively, interpreting T, A etc. as (multi)sets of terms and 
replacing variables a, a etc. by term variables and their duals t, t etc@ 

®In light of the aforementioned abuse of notation identifying a (multi)set with the disjunction of its 

k 

elements, we associate the dual of term f\ at with the (multi)set {ai,... , a*,}- 
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We extend the definition of R on individual steps to A steps as follows: 

ru{s} i a u {t} 
ru{s} AU{1} ' TVs ” Avt 

R, : a - i—^ 2-s- 

r u A u {s a t} r v a v (s a t) 

( 4 ruAu{sAi} ) 

R is extended to Resolution(/)-derivations similarly to Dfn. 16.61 dealing with A steps in the 
same way res steps. 

For [T| the argument is identical to that for Thm. 16.81 since, again, cj, steps are not 
introduced for the same reason, cf. Not. IQ 

For [2] the argument is identical to that for Thm. [Q~TT1 since the argument of Prop. I6U1 
remains valid and so there are still no contraction loops in the flow of a derivation in the 
image of R. □ 


7. Conclusions 

We have presented a series of upper bound results for the deep inference system KS. Polyno¬ 
mial simulations were given for truth tables and versions of Resolution, while polynomial-size 
proofs of functional and onto variants of the propositional pigeonhole principle were pre¬ 
sented, yielding exponential separations from all these systems, as well as bounded-depth 
Frege systems. 

We have seen that atomic flows can act as a powerful tool to analyse and manipulate 
derivations, and that often we can avoid the possibly exponential blowup arising from the 
c4,-cf rule. A relevant pursuit would be to investigate whether we can always avoid this 
blowup via, perhaps, a polynomial-time local or global flow reduction; this would imply 
that KS polynomially simulates KS + , and thus quasipolynomially simulates SKS and Frege 
systems. Despite much work in this direction a result, positive or negative, seems difficult 
to obtain; this remains arguably the most important question in the proof complexity of 
deep inference. 

Question 7.1. Does KS (quasi)polynomially simulate KS + ? 

This question has already been asked in previous works, e.g. [BG09j . jJer09| . jStr!2] 
and [Das with both positive and negative answers conjectured. We point out that our 
contribution might be helpful to any work towards a positive answer. 

We also point out that it could be that atomic flows do not themselves include sufficient 
information to carry out an efficient normalisation procedure from KS + to KS, if one exists 
at all. To this end there is work ongoing by various researchers to augment atomic flows 
with certain logical information, in the hope of accessing further normalisation procedures 
and the ability to ‘rebuild’ deep inference proofs efficiently from flow objects!! 

One might argue that our proofs in Sects. ED and [5] eventually reduced to flows of 
bounded length, and so the complexity of normalisation was trivially polynomial. A more 
sophisticated situation might involve flows of bounded width and logarithmic length, again 
resulting in a polynomial blowup, or quasipolynomial width and polylogarithmic length, 
eivine a auasmolvnomial blowiiD. We refer the reader to the recent article [Das 33 for an 


®This is known to be impossible in polynomial time, by IDasl31 . under certain hardness assumptions. 
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example of this, where quasipolynomial-size proofs of the unrestricted pigeonhole principle 
are given in KS, utilising the techniques of this paper. 

All the results that appear in this work, and indeed all other works on proof complexity 
of deeD inference. e.e\ JBG09J |Jef09 |Das in, present only simulations and upper bounds. 
We currently know of no technique for proving lower bounds for deep inference systems 
and, in light of the question above and results in [Jer09j and [Das 14] , it may be that such 
an endeavour, even for KS, could be as difficult as that of proving lower bounds for Frege 
systems. 
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